Automatic cropping of video content

ABSTRACT

Electronic devices are often equipped with a camera for capturing video content and/or a display for displaying video content. However, amateur users often capture video content without regard to composition, framing, or camera movement, resulting in video content that can be jarring or confusing to viewers. There is a need to automate the processing and presentation of video content in an aesthetically pleasing manner. The embodiments described herein provide a method of automatically cropping video content for presentation on a display.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.16/282,856 (now U.S. Publication No. 2019-0191094; published on Jun. 20,2019) filed Feb. 22, 2019, which is a continuation of U.S. applicationSer. No. 14/850,677 (now U.S. Pat. No. 10,244,175; issued on Mar. 26,2019) filed on Sep. 10, 2015, which claims the benefit of U.S.Provisional Application No. 62/130,311 filed on Mar. 9, 2015, the entiredisclosures of which are herein incorporated by reference for allpurposes.

FIELD OF THE DISCLOSURE

The present disclosure relates generally to methods of processing video,and more specifically to the automatic cropping of video content.

BACKGROUND OF THE DISCLOSURE

Electronic devices are often equipped with a camera for capturing videocontent and/or a display for displaying video content. However, amateurusers often capture video content without regard to composition,framing, or camera movement, resulting in video content that can bejarring or confusing to viewers.

Furthermore, for amateur users, determining a desirable croppingpresentation while capturing video content can impose a heavy cognitiveburden and a substantial time commitment, making the task so unappealingas to be avoided. Furthermore, inefficiencies in determining a desirablecropping presentation can cause unnecessary power consumption in batterypowered devices.

SUMMARY OF THE DISCLOSURE

Electronic devices are often equipped with a camera for capturing videocontent and/or a display for displaying video content. However, amateurusers often capture video content without regard to composition,framing, or camera movement, resulting in video content that can bejarring or confusing to viewers. There is a need to automate theprocessing and presentation of video content in an aestheticallypleasing manner. The embodiments described below provide a method ofautomatically cropping video content for presentation on a display.

BRIEF DESCRIPTION OF THE DRAWINGS

For a better understanding of the various described embodiments,reference should be made to the Detailed Description below, inconjunction with the following drawings in which like reference numeralsrefer to corresponding parts throughout the figures.

FIG. 1A is a block diagram illustrating a multifunction device with atouch-sensitive display in accordance with some embodiments.

FIG. 1B is a block diagram illustrating exemplary components for eventhandling in accordance with some embodiments.

FIG. 2 illustrates a multifunction device having a touch screen inaccordance with some embodiments.

FIG. 3 is a block diagram of an exemplary multifunction device with adisplay and a touch-sensitive surface in accordance with someembodiments.

FIG. 4 illustrates an exemplary user interface for a multifunctiondevice with a touch-sensitive surface that is separate from the displayin accordance with some embodiments.

FIG. 5 illustrates a block diagram of exemplary electronic devices incommunication with each other in accordance with some embodiments.

FIGS. 6A-6LL illustrate exemplary cropping of video content inaccordance with some embodiments.

FIGS. 7A-7D are flow diagrams illustrating a method for automaticcropping of video content in accordance with some embodiments.

FIG. 8 shows a functional block diagram of an electronic deviceconfigured in accordance with the principles of the various describedembodiments, in accordance with some embodiments.

DETAILED DESCRIPTION

In the following description of examples, reference is made to theaccompanying drawings which form a part hereof, and in which it is shownby way of illustration specific examples that can be practiced. It is tobe understood that other examples can be used and structural changes canbe made without departing from the scope of the disclosed examples.

Electronic devices are often equipped with a camera for capturing videocontent and/or a display for displaying video content. However, amateurusers often capture video content without regard to composition,framing, or camera movement, resulting in video content that can bejarring or confusing to viewers. There is a need to automate theprocessing and presentation of video content in an aestheticallypleasing manner. The embodiments described below provide a method ofautomatically cropping video content for presentation on a display.

Exemplary Devices

Embodiments of electronic devices, user interfaces for such devices, andassociated processes for using such devices are described. In someembodiments, the device is a portable communications device, such as amobile telephone, that also contains other functions, such as PDA and/ormusic player functions. Exemplary embodiments of portable multifunctiondevices include, without limitation, the iPhone®, iPod Touch®, and iPad®devices from Apple Inc. of Cupertino, Calif. Other portable electronicdevices, such as laptops or tablet computers with touch-sensitivesurfaces (e.g., touch screen displays and/or touch pads), are,optionally, used. It should also be understood that, in someembodiments, the device is not a portable communications device, but isa desktop computer or a television with a touch-sensitive surface (e.g.,a touch screen display and/or a touch pad). In some embodiments, thedevice does not have a touch screen display and/or a touch pad, butrather is capable of outputting display information (such as the userinterfaces of the disclosure) for display on a separate display device,and capable of receiving input information from a separate input devicehaving one or more input mechanisms (such as one or more buttons, atouch screen display and/or a touch pad). In some embodiments, thedevice has a display, but is capable of receiving input information froma separate input device having one or more input mechanisms (such as oneor more buttons, a touch screen display and/or a touch pad).

In the discussion that follows, an electronic device that includes adisplay and a touch-sensitive surface is described. It should beunderstood, however, that the electronic device optionally includes oneor more other physical user-interface devices, such as a physicalkeyboard, a mouse and/or a joystick. Further, as described above, itshould be understood that the described electronic device, display andtouch-sensitive surface are optionally distributed amongst two or moredevices. Therefore, as used in this disclosure, information displayed onthe electronic device or by the electronic device is optionally used todescribe information outputted by the electronic device for display on aseparate display device (touch-sensitive or not). Similarly, as used inthis disclosure, input received on the electronic device (e.g., touchinput received on a touch-sensitive surface of the electronic device) isoptionally used to describe input received on a separate input device,from which the electronic device receives input information.

The device typically supports a variety of applications, such as one ormore of the following: a drawing application, a presentationapplication, a word processing application, a website creationapplication, a disk authoring application, a spreadsheet application, agaming application, a telephone application, a video conferencingapplication, an e-mail application, an instant messaging application, aworkout support application, a photo management application, a digitalcamera application, a digital video camera application, a web browsingapplication, a digital music player application, a television channelbrowsing application, and/or a digital video player application.

The various applications that are executed on the device optionally useat least one common physical user-interface device, such as thetouch-sensitive surface. One or more functions of the touch-sensitivesurface as well as corresponding information displayed on the deviceare, optionally, adjusted and/or varied from one application to the nextand/or within a respective application. In this way, a common physicalarchitecture (such as the touch-sensitive surface) of the deviceoptionally supports the variety of applications with user interfacesthat are intuitive and transparent to the user.

Attention is now directed toward embodiments of portable or non-portabledevices with touch-sensitive displays, though the devices need notinclude touch-sensitive displays or displays in general, as describedabove. FIG. 1A is a block diagram illustrating portable or non-portablemultifunction device 100 with touch-sensitive displays 112 in accordancewith some embodiments. Touch-sensitive display 112 is sometimes called a“touch screen” for convenience, and is sometimes known as or called atouch-sensitive display system. Device 100 includes memory 102 (whichoptionally includes one or more computer readable storage mediums),memory controller 122, one or more processing units (CPU's) 120,peripherals interface 118, RF circuitry 108, audio circuitry 110,speaker 111, microphone 113, input/output (I/O) subsystem 106, otherinput or control devices 116, and external port 124. Device 100optionally includes one or more optical sensors 164. Device 100optionally includes one or more intensity sensors 165 for detectingintensity of contacts on device 100 (e.g., a touch-sensitive surfacesuch as touch-sensitive display system 112 of device 100). Device 100optionally includes one or more tactile output generators 167 forgenerating tactile outputs on device 100 (e.g., generating tactileoutputs on a touch-sensitive surface such as touch-sensitive displaysystem 112 of device 100 or touchpad 355 of device 300). Thesecomponents optionally communicate over one or more communication busesor signal lines 103.

As used in the specification and claims, the term “intensity” of acontact on a touch-sensitive surface refers to the force or pressure(force per unit area) of a contact (e.g., a finger contact) on thetouch-sensitive surface, or to a substitute (proxy) for the force orpressure of a contact on the touch-sensitive surface. The intensity of acontact has a range of values that includes at least four distinctvalues and more typically includes hundreds of distinct values (e.g., atleast 256). Intensity of a contact is, optionally, determined (ormeasured) using various approaches and various sensors or combinationsof sensors. For example, one or more force sensors underneath oradjacent to the touch-sensitive surface are, optionally, used to measureforce at various points on the touch-sensitive surface. In someimplementations, force measurements from multiple force sensors arecombined (e.g., a weighted average) to determine an estimated force of acontact. Similarly, a pressure-sensitive tip of a stylus is, optionally,used to determine a pressure of the stylus on the touch-sensitivesurface. Alternatively, the size of the contact area detected on thetouch-sensitive surface and/or changes thereto, the capacitance of thetouch-sensitive surface proximate to the contact and/or changes thereto,and/or the resistance of the touch-sensitive surface proximate to thecontact and/or changes thereto are, optionally, used as a substitute forthe force or pressure of the contact on the touch-sensitive surface. Insome implementations, the substitute measurements for contact force orpressure are used directly to determine whether an intensity thresholdhas been exceeded (e.g., the intensity threshold is described in unitscorresponding to the substitute measurements). In some implementations,the substitute measurements for contact force or pressure are convertedto an estimated force or pressure and the estimated force or pressure isused to determine whether an intensity threshold has been exceeded(e.g., the intensity threshold is a pressure threshold measured in unitsof pressure).

As used in the specification and claims, the term “tactile output”refers to physical displacement of a device relative to a previousposition of the device, physical displacement of a component (e.g., atouch-sensitive surface) of a device relative to another component(e.g., housing) of the device, or displacement of the component relativeto a center of mass of the device that will be detected by a user withthe user's sense of touch. For example, in situations where the deviceor the component of the device is in contact with a surface of a userthat is sensitive to touch (e.g., a finger, palm, or other part of auser's hand), the tactile output generated by the physical displacementwill be interpreted by the user as a tactile sensation corresponding toa perceived change in physical characteristics of the device or thecomponent of the device. For example, movement of a touch-sensitivesurface (e.g., a touch-sensitive display or trackpad) is, optionally,interpreted by the user as a “down click” or “up click” of a physicalactuator button. In some cases, a user will feel a tactile sensationsuch as an “down click” or “up click” even when there is no movement ofa physical actuator button associated with the touch-sensitive surfacethat is physically pressed (e.g., displaced) by the user's movements. Asanother example, movement of the touch-sensitive surface is, optionally,interpreted or sensed by the user as “roughness” of the touch-sensitivesurface, even when there is no change in smoothness of thetouch-sensitive surface. While such interpretations of touch by a userwill be subject to the individualized sensory perceptions of the user,there are many sensory perceptions of touch that are common to a largemajority of users. Thus, when a tactile output is described ascorresponding to a particular sensory perception of a user (e.g., an “upclick,” a “down click,” “roughness”), unless otherwise stated, thegenerated tactile output corresponds to physical displacement of thedevice or a component thereof that will generate the described sensoryperception for a typical (or average) user.

It should be appreciated that device 100 is only one example of aportable or non-portable multifunction device, and that device 100optionally has more or fewer components than shown, optionally combinestwo or more components, or optionally has a different configuration orarrangement of the components. The various components shown in FIG. 1Aare implemented in hardware, software, or a combination of both hardwareand software, including one or more signal processing and/or applicationspecific integrated circuits. Further, the various components shown inFIG. 1A are optionally implemented across two or more devices; forexample, a display and audio circuitry on a display device, atouch-sensitive surface on an input device, and remaining components ondevice 100. In such an embodiment, device 100 optionally communicateswith the display device and/or the input device to facilitate operationof the system, as described in the disclosure, and the variouscomponents described herein that relate to display and/or input remainin device 100, or are optionally included in the display and/or inputdevice, as appropriate.

Memory 102 optionally includes high-speed random access memory andoptionally also includes non-volatile memory, such as one or moremagnetic disk storage devices, flash memory devices, or othernon-volatile solid-state memory devices. Access to memory 102 by othercomponents of device 100, such as CPU 120 and the peripherals interface118, is, optionally, controlled by memory controller 122.

Peripherals interface 118 can be used to couple input and outputperipherals of the device to CPU 120 and memory 102. The one or moreprocessors 120 run or execute various software programs and/or sets ofinstructions stored in memory 102 to perform various functions fordevice 100 and to process data.

In some embodiments, peripherals interface 118, CPU 120, and memorycontroller 122 are, optionally, implemented on a single chip, such aschip 104. In some other embodiments, they are, optionally, implementedon separate chips.

RF (radio frequency) circuitry 108 receives and sends RF signals, alsocalled electromagnetic signals. RF circuitry 108 converts electricalsignals to/from electromagnetic signals and communicates withcommunications networks and other communications devices via theelectromagnetic signals. RF circuitry 108 optionally includes well-knowncircuitry for performing these functions, including but not limited toan antenna system, an RF transceiver, one or more amplifiers, a tuner,one or more oscillators, a digital signal processor, a CODEC chipset, asubscriber identity module (SIM) card, memory, and so forth. RFcircuitry 108 optionally communicates with networks, such as theInternet, also referred to as the World Wide Web (WWW), an intranetand/or a wireless network, such as a cellular telephone network, awireless local area network (LAN) and/or a metropolitan area network(MAN), and other devices by wireless communication. The wirelesscommunication optionally uses any of a plurality of communicationsstandards, protocols and technologies, including but not limited toGlobal System for Mobile Communications (GSM), Enhanced Data GSMEnvironment (EDGE), high-speed downlink packet access (HSDPA),high-speed uplink packet access (HSUPA), Evolution, Data-Only (EV-DO),HSPA, HSPA+, Dual-Cell HSPA (DC-HSPDA), long term evolution (LTE), nearfield communication (NFC), wideband code division multiple access(W-CDMA), code division multiple access (CDMA), time division multipleaccess (TDMA), Bluetooth, Wireless Fidelity (Wi-Fi) (e.g., IEEE 802.11a,IEEE 802.11b, IEEE 802.11g and/or IEEE 802.11n), voice over InternetProtocol (VoIP), Wi-MAX, a protocol for e-mail (e.g., Internet messageaccess protocol (IMAP) and/or post office protocol (POP)), instantmessaging (e.g., extensible messaging and presence protocol (XMPP),Session Initiation Protocol for Instant Messaging and PresenceLeveraging Extensions (SIMPLE), Instant Messaging and Presence Service(IMPS)), and/or Short Message Service (SMS), or any other suitablecommunication protocol, including communication protocols not yetdeveloped as of the filing date of this document.

Audio circuitry 110, speaker 111, and microphone 113 provide an audiointerface between a user and device 100. Audio circuitry 110 receivesaudio data from peripherals interface 118, converts the audio data to anelectrical signal, and transmits the electrical signal to speaker 111.Speaker 111 converts the electrical signal to human-audible sound waves.Audio circuitry 110 also receives electrical signals converted bymicrophone 113 from sound waves. Audio circuitry 110 converts theelectrical signal to audio data and transmits the audio data toperipherals interface 118 for processing. Audio data is, optionally,retrieved from and/or transmitted to memory 102 and/or RF circuitry 108by peripherals interface 118. In some embodiments, audio circuitry 110also includes a headset jack (e.g., 212, FIG. 2). The headset jackprovides an interface between audio circuitry 110 and removable audioinput/output peripherals, such as output-only headphones or a headsetwith both output (e.g., a headphone for one or both ears) and input(e.g., a microphone).

I/O subsystem 106 couples input/output peripherals on device 100, suchas touch screen 112 and other input control devices 116, to peripheralsinterface 118. I/O subsystem 106 optionally includes display controller156, optical sensor controller 158, intensity sensor controller 159,haptic feedback controller 161 and one or more input controllers 160 forother input or control devices. The one or more input controllers 160receive/send electrical signals from/to other input or control devices116. The other input control devices 116 optionally include physicalbuttons (e.g., push buttons, rocker buttons, etc.), dials, sliderswitches, joysticks, click wheels, and so forth. In some alternateembodiments, input controller(s) 160 are, optionally, coupled to any (ornone) of the following: a keyboard, infrared port, USB port, and apointer device such as a mouse. The one or more buttons (e.g., 208, FIG.2) optionally include an up/down button for volume control of speaker111 and/or microphone 113. The one or more buttons optionally include apush button (e.g., 206, FIG. 2).

Touch-sensitive display 112 provides an input interface and an outputinterface between the device and a user. As described above, thetouch-sensitive operation and the display operation of touch-sensitivedisplay 112 are optionally separated from each other, such that adisplay device is used for display purposes and a touch-sensitivesurface (whether display or not) is used for input detection purposes,and the described components and functions are modified accordingly.However, for simplicity, the following description is provided withreference to a touch-sensitive display. Display controller 156 receivesand/or sends electrical signals from/to touch screen 112. Touch screen112 displays visual output to the user. The visual output optionallyincludes graphics, text, icons, video, and any combination thereof(collectively termed “graphics”). In some embodiments, some or all ofthe visual output corresponds to user-interface objects.

Touch screen 112 has a touch-sensitive surface, sensor or set of sensorsthat accepts input from the user based on haptic and/or tactile contact.Touch screen 112 and display controller 156 (along with any associatedmodules and/or sets of instructions in memory 102) detect contact (andany movement or breaking of the contact) on touch screen 112 andconverts the detected contact into interaction with user-interfaceobjects (e.g., one or more soft keys, icons, web pages or images) thatare displayed on touch screen 112. In an exemplary embodiment, a pointof contact between touch screen 112 and the user corresponds to a fingerof the user.

Touch screen 112 optionally uses LCD (liquid crystal display)technology, LPD (light emitting polymer display) technology, or LED(light emitting diode) technology, although other display technologiesare used in other embodiments. Touch screen 112 and display controller156 optionally detect contact and any movement or breaking thereof usingany of a plurality of touch sensing technologies now known or laterdeveloped, including but not limited to capacitive, resistive, infrared,and surface acoustic wave technologies, as well as other proximitysensor arrays or other elements for determining one or more points ofcontact with touch screen 112. In an exemplary embodiment, projectedmutual capacitance sensing technology is used, such as that found in theiPhone®, iPod Touch®, and iPad® from Apple Inc. of Cupertino, Calif.

Touch screen 112 optionally has a video resolution in excess of 100 dpi.In some embodiments, the touch screen has a video resolution ofapproximately 160 dpi. The user optionally makes contact with touchscreen 112 using any suitable object or appendage, such as a stylus, afinger, and so forth. In some embodiments, the user interface isdesigned to work primarily with finger-based contacts and gestures,which can be less precise than stylus-based input due to the larger areaof contact of a finger on the touch screen. In some embodiments, thedevice translates the rough finger-based input into a precisepointer/cursor position or command for performing the actions desired bythe user.

In some embodiments, in addition to the touch screen, device 100optionally includes a touchpad (not shown) for activating ordeactivating particular functions. In some embodiments, the touchpad isa touch-sensitive area of the device that, unlike the touch screen, doesnot display visual output. The touchpad is, optionally, atouch-sensitive surface that is separate from touch screen 112 or anextension of the touch-sensitive surface formed by the touch screen.

Device 100 also includes power system 162 for powering the variouscomponents. Power system 162 optionally includes a power managementsystem, one or more power sources (e.g., battery, alternating current(AC)), a recharging system, a power failure detection circuit, a powerconverter or inverter, a power status indicator (e.g., a light-emittingdiode (LED)) and any other components associated with the generation,management and distribution of power in portable or non-portabledevices.

Device 100 optionally also includes one or more optical sensors 164.FIG. 1A shows an optical sensor coupled to optical sensor controller 158in I/O subsystem 106. Optical sensor 164 optionally includescharge-coupled device (CCD) or complementary metal-oxide semiconductor(CMOS) phototransistors. Optical sensor 164 receives light from theenvironment, projected through one or more lens, and converts the lightto data representing an image. In conjunction with imaging module 143(also called a camera module), optical sensor 164 optionally capturesstill images or video. In some embodiments, an optical sensor is locatedon the back of device 100, opposite touch screen display 112 on thefront of the device, so that the touch screen display is enabled for useas a viewfinder for still and/or video image acquisition. In someembodiments, another optical sensor is located on the front of thedevice so that the user's image is, optionally, obtained forvideoconferencing while the user views the other video conferenceparticipants on the touch screen display.

Device 100 optionally also includes one or more contact intensitysensors 165. FIG. 1A shows a contact intensity sensor coupled tointensity sensor controller 159 in I/O subsystem 106. Contact intensitysensor 165 optionally includes one or more piezoresistive strain gauges,capacitive force sensors, electric force sensors, piezoelectric forcesensors, optical force sensors, capacitive touch-sensitive surfaces, orother intensity sensors (e.g., sensors used to measure the force (orpressure) of a contact on a touch-sensitive surface). Contact intensitysensor 165 receives contact intensity information (e.g., pressureinformation or a proxy for pressure information) from the environment.In some embodiments, at least one contact intensity sensor is collocatedwith, or proximate to, a touch-sensitive surface (e.g., touch-sensitivedisplay system 112). In some embodiments, at least one contact intensitysensor is located on the back of device 100, opposite touch screendisplay 112 which is located on the front of device 100.

Device 100 optionally also includes one or more proximity sensors 166.FIG. 1A shows proximity sensor 166 coupled to peripherals interface 118.Alternately, proximity sensor 166 is coupled to input controller 160 inI/O subsystem 106. In some embodiments, the proximity sensor turns offand disables touch screen 112 when the multifunction device is placednear the user's ear (e.g., when the user is making a phone call).

Device 100 optionally also includes one or more tactile outputgenerators 167. FIG. 1A shows a tactile output generator coupled tohaptic feedback controller 161 in I/O subsystem 106. Tactile outputgenerator 167 optionally includes one or more electroacoustic devicessuch as speakers or other audio components and/or electromechanicaldevices that convert energy into linear motion such as a motor,solenoid, electroactive polymer, piezoelectric actuator, electrostaticactuator, or other tactile output generating component (e.g., acomponent that converts electrical signals into tactile outputs on thedevice). Contact intensity sensor 165 receives tactile feedbackgeneration instructions from haptic feedback module 133 and generatestactile outputs on device 100 that are capable of being sensed by a userof device 100. In some embodiments, at least one tactile outputgenerator is collocated with, or proximate to, a touch-sensitive surface(e.g., touch-sensitive display system 112) and, optionally, generates atactile output by moving the touch-sensitive surface vertically (e.g.,in/out of a surface of device 100) or laterally (e.g., back and forth inthe same plane as a surface of device 100). In some embodiments, atleast one tactile output generator sensor is located on the back ofdevice 100, opposite touch screen display 112 which is located on thefront of device 100.

Device 100 optionally also includes one or more accelerometers 168. FIG.1A shows accelerometer 168 coupled to peripherals interface 118.Alternately, accelerometer 168 is, optionally, coupled to an inputcontroller 160 in I/O subsystem 106. In some embodiments, information isdisplayed on the touch screen display in a portrait view or a landscapeview based on an analysis of data received from the one or moreaccelerometers. Device 100 optionally includes, in addition toaccelerometer(s) 168, a magnetometer (not shown) and a GPS (or GLONASSor other global navigation system) receiver (not shown) for obtaininginformation concerning the location and orientation (e.g., portrait orlandscape) of device 100.

In some embodiments, the software components stored in memory 102include operating system 126, communication module (or set ofinstructions) 128, contact/motion module (or set of instructions) 130,graphics module (or set of instructions) 132, text input module (or setof instructions) 134, Global Positioning System (GPS) module (or set ofinstructions) 135, and applications (or sets of instructions) 136.Furthermore, in some embodiments memory 102 stores device/globalinternal state 157, as shown in FIGS. 1A and 3. Device/global internalstate 157 includes one or more of: active application state, indicatingwhich applications, if any, are currently active; display state,indicating what applications, views or other information occupy variousregions of touch screen display 112; sensor state, including informationobtained from the device's various sensors and input control devices116; and location information concerning the device's location and/orattitude.

Operating system 126 (e.g., Darwin, RTXC, LINUX, UNIX, OS X, WINDOWS, oran embedded operating system such as VxWorks) includes various softwarecomponents and/or drivers for controlling and managing general systemtasks (e.g., memory management, storage device control, powermanagement, etc.) and facilitates communication between various hardwareand software components.

Communication module 128 facilitates communication with other devicesover one or more external ports 124 and also includes various softwarecomponents for handling data received by RF circuitry 108 and/orexternal port 124. External port 124 (e.g., Universal Serial Bus (USB),FIREWIRE, etc.) is adapted for coupling directly to other devices orindirectly over a network (e.g., the Internet, wireless LAN, etc.). Insome embodiments, the external port is a multi-pin (e.g., 30-pin)connector that is the same as, or similar to and/or compatible with the30-pin connector used on iPod (trademark of Apple Inc.) devices.

Contact/motion module 130 optionally detects contact with touch screen112 (in conjunction with display controller 156) and othertouch-sensitive devices (e.g., a touchpad or physical click wheel).Contact/motion module 130 includes various software components forperforming various operations related to detection of contact, such asdetermining if contact has occurred (e.g., detecting a finger-downevent), determining an intensity of the contact (e.g., the force orpressure of the contact or a substitute for the force or pressure of thecontact) determining if there is movement of the contact and trackingthe movement across the touch-sensitive surface (e.g., detecting one ormore finger-dragging events), and determining if the contact has ceased(e.g., detecting a finger-up event or a break in contact).Contact/motion module 130 receives contact data from the touch-sensitivesurface. Determining movement of the point of contact, which isrepresented by a series of contact data, optionally includes determiningspeed (magnitude), velocity (magnitude and direction), and/or anacceleration (a change in magnitude and/or direction) of the point ofcontact. These operations are, optionally, applied to single contacts(e.g., one finger contacts) or to multiple simultaneous contacts (e.g.,“multitouch”/multiple finger contacts). In some embodiments,contact/motion module 130 and display controller 156 detect contact on atouchpad.

In some embodiments, contact/motion module 130 uses a set of one or moreintensity thresholds to determine whether an operation has beenperformed by a user (e.g., to determine whether a user has “clicked” onan icon). In some embodiments at least a subset of the intensitythresholds are determined in accordance with software parameters (e.g.,the intensity thresholds are not determined by the activation thresholdsof particular physical actuators and can be adjusted without changingthe physical hardware of device 100). For example, a mouse “click”threshold of a trackpad or touch screen display can be set to any of alarge range of predefined thresholds values without changing thetrackpad or touch screen display hardware. Additionally, in someimplementations a user of the device is provided with software settingsfor adjusting one or more of the set of intensity thresholds (e.g., byadjusting individual intensity thresholds and/or by adjusting aplurality of intensity thresholds at once with a system-level click“intensity” parameter).

Contact/motion module 130 optionally detects a gesture input by a user.Different gestures on the touch-sensitive surface have different contactpatterns and intensities. Thus, a gesture is, optionally, detected bydetecting a particular contact pattern. For example, detecting a fingertap gesture includes detecting a finger-down event followed by detectinga finger-up (lift off) event at the same position (or substantially thesame position) as the finger-down event (e.g., at the position of anicon). As another example, detecting a finger swipe gesture on thetouch-sensitive surface includes detecting a finger-down event followedby detecting one or more finger-dragging events, and subsequentlyfollowed by detecting a finger-up (lift off) event.

Graphics module 132 includes various known software components forrendering and displaying graphics on touch screen 112 or other display,including components for changing the visual impact (e.g., brightness,transparency, saturation, contrast or other visual property) of graphicsthat are displayed. As used herein, the term “graphics” includes anyobject that can be displayed to a user, including without limitationtext, web pages, icons (such as user-interface objects including softkeys), digital images, videos, animations and the like.

In some embodiments, graphics module 132 stores data representinggraphics to be used. Each graphic is, optionally, assigned acorresponding code. Graphics module 132 receives, from applicationsetc., one or more codes specifying graphics to be displayed along with,if necessary, coordinate data and other graphic property data, and thengenerates screen image data to output to display controller 156.

Haptic feedback module 133 includes various software components forgenerating instructions used by tactile output generator(s) 167 toproduce tactile outputs at one or more locations on device 100 inresponse to user interactions with device 100.

Text input module 134, which is, optionally, a component of graphicsmodule 132, provides soft keyboards for entering text in variousapplications (e.g., contacts 137, e-mail 140, IM 141, browser 147, andany other application that needs text input).

GPS module 135 determines the location of the device and provides thisinformation for use in various applications (e.g., to telephone 138 foruse in location-based dialing, to camera 143 as picture/video metadata,and to applications that provide location-based services such as weatherwidgets, local yellow page widgets, and map/navigation widgets).

Applications 136 optionally include the following modules (or sets ofinstructions), or a subset or superset thereof:

-   -   contacts module 137 (sometimes called an address book or contact        list);    -   telephone module 138;    -   video conferencing module 139;    -   e-mail client module 140;    -   instant messaging (IM) module 141;    -   workout support module 142;    -   camera module 143 for still and/or video images;    -   image management module 144;    -   browser module 147;    -   calendar module 148;    -   widget modules 149, which optionally include one or more of:        weather widget 149-1, stocks widget 149-2, calculator widget        149-3, alarm clock widget 149-4, dictionary widget 149-5, and        other widgets obtained by the user, as well as user-created        widgets 149-6;    -   widget creator module 150 for making user-created widgets 149-6;    -   search module 151;    -   video and music player module 152, which is, optionally, made up        of a video player module and a music player module;    -   notes module 153;    -   map module 154;    -   online video module 155.

Examples of other applications 136 that are, optionally, stored inmemory 102 include other word processing applications, other imageediting applications, drawing applications, presentation applications,JAVA-enabled applications, encryption, digital rights management, voicerecognition, and voice replication.

In conjunction with touch screen 112, display controller 156, contactmodule 130, graphics module 132, and text input module 134, contactsmodule 137 are, optionally, used to manage an address book or contactlist (e.g., stored in application internal state 192 of contacts module137 in memory 102 or memory 370), including: adding name(s) to theaddress book; deleting name(s) from the address book; associatingtelephone number(s), e-mail address(es), physical address(es) or otherinformation with a name; associating an image with a name; categorizingand sorting names; providing telephone numbers or e-mail addresses toinitiate and/or facilitate communications by telephone 138, videoconference 139, e-mail 140, or IM 141; and so forth.

In conjunction with RF circuitry 108, audio circuitry 110, speaker 111,microphone 113, touch screen 112, display controller 156, contact module130, graphics module 132, and text input module 134, telephone module138 are, optionally, used to enter a sequence of characterscorresponding to a telephone number, access one or more telephonenumbers in address book 137, modify a telephone number that has beenentered, dial a respective telephone number, conduct a conversation anddisconnect or hang up when the conversation is completed. As notedabove, the wireless communication optionally uses any of a plurality ofcommunications standards, protocols and technologies.

In conjunction with RF circuitry 108, audio circuitry 110, speaker 111,microphone 113, touch screen 112, display controller 156, optical sensor164, optical sensor controller 158, contact module 130, graphics module132, text input module 134, contact list 137, and telephone module 138,videoconferencing module 139 includes executable instructions toinitiate, conduct, and terminate a video conference between a user andone or more other participants in accordance with user instructions.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact module 130, graphics module 132, and text inputmodule 134, e-mail client module 140 includes executable instructions tocreate, send, receive, and manage e-mail in response to userinstructions. In conjunction with image management module 144, e-mailclient module 140 makes it very easy to create and send e-mails withstill or video images taken with camera module 143.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact module 130, graphics module 132, and text inputmodule 134, the instant messaging module 141 includes executableinstructions to enter a sequence of characters corresponding to aninstant message, to modify previously entered characters, to transmit arespective instant message (for example, using a Short Message Service(SMS) or Multimedia Message Service (MMS) protocol for telephony-basedinstant messages or using XMPP, SIMPLE, or IMPS for Internet-basedinstant messages), to receive instant messages and to view receivedinstant messages. In some embodiments, transmitted and/or receivedinstant messages optionally include graphics, photos, audio files, videofiles and/or other attachments as are supported in a MMS and/or anEnhanced Messaging Service (EMS). As used herein, “instant messaging”refers to both telephony-based messages (e.g., messages sent using SMSor MMS) and Internet-based messages (e.g., messages sent using XMPP,SIMPLE, or IMPS).

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact module 130, graphics module 132, text inputmodule 134, GPS module 135, map module 154, and music player module 146,workout support module 142 includes executable instructions to createworkouts (e.g., with time, distance, and/or calorie burning goals);communicate with workout sensors (sports devices); receive workoutsensor data; calibrate sensors used to monitor a workout; select andplay music for a workout; and display, store and transmit workout data.

In conjunction with touch screen 112, display controller 156, opticalsensor(s) 164, optical sensor controller 158, contact module 130,graphics module 132, and image management module 144, camera module 143includes executable instructions to capture still images or video(including a video stream) and store them into memory 102, modifycharacteristics of a still image or video, or delete a still image orvideo from memory 102.

In conjunction with touch screen 112, display controller 156, contactmodule 130, graphics module 132, text input module 134, and cameramodule 143, image management module 144 includes executable instructionsto arrange, modify (e.g., edit), or otherwise manipulate, label, delete,present (e.g., in a digital slide show or album), and store still and/orvideo images.

In conjunction with RF circuitry 108, touch screen 112, display systemcontroller 156, contact module 130, graphics module 132, and text inputmodule 134, browser module 147 includes executable instructions tobrowse the Internet in accordance with user instructions, includingsearching, linking to, receiving, and displaying web pages or portionsthereof, as well as attachments and other files linked to web pages.

In conjunction with RF circuitry 108, touch screen 112, display systemcontroller 156, contact module 130, graphics module 132, text inputmodule 134, e-mail client module 140, and browser module 147, calendarmodule 148 includes executable instructions to create, display, modify,and store calendars and data associated with calendars (e.g., calendarentries, to do lists, etc.) in accordance with user instructions.

In conjunction with RF circuitry 108, touch screen 112, display systemcontroller 156, contact module 130, graphics module 132, text inputmodule 134, and browser module 147, widget modules 149 aremini-applications that are, optionally, downloaded and used by a user(e.g., weather widget 149-1, stocks widget 149-2, calculator widget149-3, alarm clock widget 149-4, and dictionary widget 149-5) or createdby the user (e.g., user-created widget 149-6). In some embodiments, awidget includes an HTML (Hypertext Markup Language) file, a CSS(Cascading Style Sheets) file, and a JavaScript file. In someembodiments, a widget includes an XML (Extensible Markup Language) fileand a JavaScript file (e.g., Yahoo! Widgets).

In conjunction with RF circuitry 108, touch screen 112, display systemcontroller 156, contact module 130, graphics module 132, text inputmodule 134, and browser module 147, the widget creator module 150 are,optionally, used by a user to create widgets (e.g., turning auser-specified portion of a web page into a widget).

In conjunction with touch screen 112, display system controller 156,contact module 130, graphics module 132, and text input module 134,search module 151 includes executable instructions to search for text,music, sound, image, video, and/or other files in memory 102 that matchone or more search criteria (e.g., one or more user-specified searchterms) in accordance with user instructions.

In conjunction with touch screen 112, display system controller 156,contact module 130, graphics module 132, audio circuitry 110, speaker111, RF circuitry 108, and browser module 147, video and music playermodule 152 includes executable instructions that allow the user todownload and play back recorded music and other sound files stored inone or more file formats, such as MP3 or AAC files, and executableinstructions to display, present or otherwise play back videos (e.g., ontouch screen 112 or on an external, connected display via external port124). In some embodiments, device 100 optionally includes thefunctionality of an MP3 player, such as an iPod (trademark of AppleInc.).

In conjunction with touch screen 112, display controller 156, contactmodule 130, graphics module 132, and text input module 134, notes module153 includes executable instructions to create and manage notes, to dolists, and the like in accordance with user instructions.

In conjunction with RF circuitry 108, touch screen 112, display systemcontroller 156, contact module 130, graphics module 132, text inputmodule 134, GPS module 135, and browser module 147, map module 154 are,optionally, used to receive, display, modify, and store maps and dataassociated with maps (e.g., driving directions; data on stores and otherpoints of interest at or near a particular location; and otherlocation-based data) in accordance with user instructions.

In conjunction with touch screen 112, display system controller 156,contact module 130, graphics module 132, audio circuitry 110, speaker111, RF circuitry 108, text input module 134, e-mail client module 140,and browser module 147, online video module 155 includes instructionsthat allow the user to access, browse, receive (e.g., by streamingand/or download), play back (e.g., on the touch screen or on anexternal, connected display via external port 124), send an e-mail witha link to a particular online video, and otherwise manage online videosin one or more file formats, such as H.264. In some embodiments, instantmessaging module 141, rather than e-mail client module 140, is used tosend a link to a particular online video.

Each of the above identified modules and applications correspond to aset of executable instructions for performing one or more functionsdescribed above and the methods described in this application (e.g., thecomputer-implemented methods and other information processing methodsdescribed herein). These modules (i.e., sets of instructions) need notbe implemented as separate software programs, procedures or modules, andthus various subsets of these modules are, optionally, combined orotherwise re-arranged in various embodiments. In some embodiments,memory 102 optionally stores a subset of the modules and data structuresidentified above. Furthermore, memory 102 optionally stores additionalmodules and data structures not described above.

In some embodiments, device 100 is a device where operation of apredefined set of functions on the device is performed exclusivelythrough a touch screen and/or a touchpad (whether included in device 100or on a separate device, such as an input device). By using a touchscreen and/or a touchpad as the primary input control device foroperation of device 100, the number of physical input control devices(such as push buttons, dials, and the like) on device 100 is,optionally, reduced.

The predefined set of functions that are performed exclusively through atouch screen and/or a touchpad optionally include navigation betweenuser interfaces. In some embodiments, the touchpad, when touched by theuser, navigates device 100 to a main, home, or root menu from any userinterface that is displayed on device 100. In such embodiments, a “menubutton” is implemented using a touchpad. In some other embodiments, themenu button is a physical push button or other physical input controldevice instead of a touchpad.

FIG. 1B is a block diagram illustrating exemplary components for eventhandling in accordance with some embodiments. In some embodiments,memory 102 (in FIG. 1A) or 370 (FIG. 3) includes event sorter 170 (e.g.,in operating system 126) and a respective application 136-1 (e.g., anyof the aforementioned applications 137-151, 155, 380-390).

Event sorter 170 receives event information and determines theapplication 136-1 and application view 191 of application 136-1 to whichto deliver the event information. Event sorter 170 includes eventmonitor 171 and event dispatcher module 174. In some embodiments,application 136-1 includes application internal state 192, whichindicates the current application view(s) displayed on touch-sensitivedisplay 112 when the application is active or executing. In someembodiments, device/global internal state 157 is used by event sorter170 to determine which application(s) is (are) currently active, andapplication internal state 192 is used by event sorter 170 to determineapplication views 191 to which to deliver event information.

In some embodiments, application internal state 192 includes additionalinformation, such as one or more of: resume information to be used whenapplication 136-1 resumes execution, user interface state informationthat indicates information being displayed or that is ready for displayby application 136-1, a state queue for enabling the user to go back toa prior state or view of application 136-1, and a redo/undo queue ofprevious actions taken by the user.

Event monitor 171 receives event information from peripherals interface118. Event information includes information about a sub-event (e.g., auser touch on touch-sensitive display 112, as part of a multi-touchgesture). Peripherals interface 118 transmits information it receivesfrom I/O subsystem 106 or a sensor, such as proximity sensor 166,accelerometer(s) 168, and/or microphone 113 (through audio circuitry110). Information that peripherals interface 118 receives from I/Osubsystem 106 includes information from touch-sensitive display 112 or atouch-sensitive surface.

In some embodiments, event monitor 171 sends requests to the peripheralsinterface 118 at predetermined intervals. In response, peripheralsinterface 118 transmits event information. In other embodiments,peripheral interface 118 transmits event information only when there isa significant event (e.g., receiving an input above a predeterminednoise threshold and/or for more than a predetermined duration).

In some embodiments, event sorter 170 also includes a hit viewdetermination module 172 and/or an active event recognizer determinationmodule 173.

Hit view determination module 172 provides software procedures fordetermining where a sub-event has taken place within one or more views,when touch-sensitive display 112 displays more than one view. Views aremade up of controls and other elements that a user can see on thedisplay.

Another aspect of the user interface associated with an application is aset of views, sometimes herein called application views or userinterface windows, in which information is displayed and touch-basedgestures occur. The application views (of a respective application) inwhich a touch is detected optionally correspond to programmatic levelswithin a programmatic or view hierarchy of the application. For example,the lowest level view in which a touch is detected is, optionally,called the hit view, and the set of events that are recognized as properinputs are, optionally, determined based, at least in part, on the hitview of the initial touch that begins a touch-based gesture.

Hit view determination module 172 receives information related tosub-events of a touch-based gesture. When an application has multipleviews organized in a hierarchy, hit view determination module 172identifies a hit view as the lowest view in the hierarchy which shouldhandle the sub-event. In most circumstances, the hit view is the lowestlevel view in which an initiating sub-event occurs (i.e., the firstsub-event in the sequence of sub-events that form an event or potentialevent). Once the hit view is identified by the hit view determinationmodule, the hit view typically receives all sub-events related to thesame touch or input source for which it was identified as the hit view.

Active event recognizer determination module 173 determines which viewor views within a view hierarchy should receive a particular sequence ofsub-events. In some embodiments, active event recognizer determinationmodule 173 determines that only the hit view should receive a particularsequence of sub-events. In other embodiments, active event recognizerdetermination module 173 determines that all views that include thephysical location of a sub-event are actively involved views, andtherefore determines that all actively involved views should receive aparticular sequence of sub-events. In other embodiments, even if touchsub-events were entirely confined to the area associated with oneparticular view, views higher in the hierarchy would still remain asactively involved views.

Event dispatcher module 174 dispatches the event information to an eventrecognizer (e.g., event recognizer 180). In embodiments including activeevent recognizer determination module 173, event dispatcher module 174delivers the event information to an event recognizer determined byactive event recognizer determination module 173. In some embodiments,event dispatcher module 174 stores in an event queue the eventinformation, which is retrieved by a respective event receiver module182.

In some embodiments, operating system 126 includes event sorter 170.Alternatively, application 136-1 includes event sorter 170. In yet otherembodiments, event sorter 170 is a stand-alone module, or a part ofanother module stored in memory 102, such as contact/motion module 130.

In some embodiments, application 136-1 includes a plurality of eventhandlers 190 and one or more application views 191, each of whichincludes instructions for handling touch events that occur within arespective view of the application's user interface. Each applicationview 191 of the application 136-1 includes one or more event recognizers180. Typically, a respective application view 191 includes a pluralityof event recognizers 180. In other embodiments, one or more of eventrecognizers 180 are part of a separate module, such as a user interfacekit (not shown) or a higher level object from which application 136-1inherits methods and other properties. In some embodiments, a respectiveevent handler 190 includes one or more of: data updater 176, objectupdater 177, GUI updater 178, and/or event data 179 received from eventsorter 170. Event handler 190 optionally utilizes or calls data updater176, object updater 177 or GUI updater 178 to update the applicationinternal state 192. Alternatively, one or more of the application views191 includes one or more respective event handlers 190. Also, in someembodiments, one or more of data updater 176, object updater 177, andGUI updater 178 are included in a respective application view 191.

A respective event recognizer 180 receives event information (e.g.,event data 179) from event sorter 170, and identifies an event from theevent information. Event recognizer 180 includes event receiver 182 andevent comparator 184. In some embodiments, event recognizer 180 alsoincludes at least a subset of: metadata 183, and event deliveryinstructions 188 (which optionally include sub-event deliveryinstructions).

Event receiver 182 receives event information from event sorter 170. Theevent information includes information about a sub-event, for example, atouch or a touch movement. Depending on the sub-event, the eventinformation also includes additional information, such as location ofthe sub-event. When the sub-event concerns motion of a touch, the eventinformation optionally also includes speed and direction of thesub-event. In some embodiments, events include rotation of the devicefrom one orientation to another (e.g., from a portrait orientation to alandscape orientation, or vice versa), and the event informationincludes corresponding information about the current orientation (alsocalled device attitude) of the device.

Event comparator 184 compares the event information to predefined eventor sub-event definitions and, based on the comparison, determines anevent or sub-event, or determines or updates the state of an event orsub-event. In some embodiments, event comparator 184 includes eventdefinitions 186. Event definitions 186 contain definitions of events(e.g., predefined sequences of sub-events), for example, event 1(187-1), event 2 (187-2), and others. In some embodiments, sub-events inan event 187 include, for example, touch begin, touch end, touchmovement, touch cancellation, and multiple touching. In one example, thedefinition for event 1 (187-1) is a double tap on a displayed object.The double tap, for example, comprises a first touch (touch begin) onthe displayed object for a predetermined phase, a first lift-off (touchend) for a predetermined phase, a second touch (touch begin) on thedisplayed object for a predetermined phase, and a second lift-off (touchend) for a predetermined phase. In another example, the definition forevent 2 (187-2) is a dragging on a displayed object. The dragging, forexample, comprises a touch (or contact) on the displayed object for apredetermined phase, a movement of the touch across touch-sensitivedisplay 112, and lift-off of the touch (touch end). In some embodiments,the event also includes information for one or more associated eventhandlers 190.

In some embodiments, event definition 187 includes a definition of anevent for a respective user-interface object. In some embodiments, eventcomparator 184 performs a hit test to determine which user-interfaceobject is associated with a sub-event. For example, in an applicationview in which three user-interface objects are displayed ontouch-sensitive display 112, when a touch is detected on touch-sensitivedisplay 112, event comparator 184 performs a hit test to determine whichof the three user-interface objects is associated with the touch(sub-event). If each displayed object is associated with a respectiveevent handler 190, the event comparator uses the result of the hit testto determine which event handler 190 should be activated. For example,event comparator 184 selects an event handler associated with thesub-event and the object triggering the hit test.

In some embodiments, the definition for a respective event 187 alsoincludes delayed actions that delay delivery of the event informationuntil after it has been determined whether the sequence of sub-eventsdoes or does not correspond to the event recognizer's event type.

When a respective event recognizer 180 determines that the series ofsub-events do not match any of the events in event definitions 186, therespective event recognizer 180 enters an event impossible, eventfailed, or event ended state, after which it disregards subsequentsub-events of the touch-based gesture. In this situation, other eventrecognizers, if any, that remain active for the hit view continue totrack and process sub-events of an ongoing touch-based gesture.

In some embodiments, a respective event recognizer 180 includes metadata183 with configurable properties, flags, and/or lists that indicate howthe event delivery system should perform sub-event delivery to activelyinvolved event recognizers. In some embodiments, metadata 183 includesconfigurable properties, flags, and/or lists that indicate how eventrecognizers interact, or are enabled to interact, with one another. Insome embodiments, metadata 183 includes configurable properties, flags,and/or lists that indicate whether sub-events are delivered to varyinglevels in the view or programmatic hierarchy.

In some embodiments, a respective event recognizer 180 activates eventhandler 190 associated with an event when one or more particularsub-events of an event are recognized. In some embodiments, a respectiveevent recognizer 180 delivers event information associated with theevent to event handler 190. Activating an event handler 190 is distinctfrom sending (and deferred sending) sub-events to a respective hit view.In some embodiments, event recognizer 180 throws a flag associated withthe recognized event, and event handler 190 associated with the flagcatches the flag and performs a predefined process.

In some embodiments, event delivery instructions 188 include sub-eventdelivery instructions that deliver event information about a sub-eventwithout activating an event handler. Instead, the sub-event deliveryinstructions deliver event information to event handlers associated withthe series of sub-events or to actively involved views. Event handlersassociated with the series of sub-events or with actively involved viewsreceive the event information and perform a predetermined process.

In some embodiments, data updater 176 creates and updates data used inapplication 136-1. For example, data updater 176 updates the telephonenumber used in contacts module 137, or stores a video file used in videoplayer module 145. In some embodiments, object updater 177 creates andupdates objects used in application 136-1. For example, object updater177 creates a new user-interface object or updates the position of auser-interface object. GUI updater 178 updates the GUI. For example, GUIupdater 178 prepares display information and sends it to graphics module132 for display on a touch-sensitive display.

In some embodiments, event handler(s) 190 includes or has access to dataupdater 176, object updater 177, and GUI updater 178. In someembodiments, data updater 176, object updater 177, and GUI updater 178are included in a single module of a respective application 136-1 orapplication view 191. In other embodiments, they are included in two ormore software modules.

It shall be understood that the foregoing discussion regarding eventhandling of user touches on touch-sensitive displays and/or touchpadsalso applies to other forms of user inputs to operate multifunctiondevices 100 with input-devices, not all of which are initiated on touchscreens. For example, mouse movement and mouse button presses,optionally coordinated with single or multiple keyboard presses orholds; contact movements such as taps, drags, scrolls, etc., ontouch-pads; pen stylus inputs; movement of the device; oralinstructions; detected eye movements; biometric inputs; and/or anycombination thereof are optionally utilized as inputs corresponding tosub-events which define an event to be recognized.

FIG. 2 illustrates a portable or non-portable multifunction device 100having a touch screen 112 in accordance with some embodiments. As statedabove, multifunction device 100 is described as having the variousillustrated structures (such as touch screen 112, speaker 111,accelerometer 168, microphone 113, etc.); however, it is understood thatthese structures optionally reside on separate devices. For example,display-related structures (e.g., display, speaker, etc.) and/orfunctions optionally reside on a separate display device, input-relatedstructures (e.g., touch-sensitive surface, microphone, accelerometer,etc.) and/or functions optionally reside on a separate input device, andremaining structures and/or functions optionally reside on multifunctiondevice 100.

The touch screen 112 optionally displays one or more graphics withinuser interface (UI) 200. In this embodiment, as well as others describedbelow, a user is enabled to select one or more of the graphics by makinga gesture on the graphics, for example, with one or more fingers 202(not drawn to scale in the figure) or one or more styluses 203 (notdrawn to scale in the figure). In some embodiments, selection of one ormore graphics occurs when the user breaks contact with the one or moregraphics. In some embodiments, the gesture optionally includes one ormore taps, one or more swipes (from left to right, right to left, upwardand/or downward) and/or a rolling of a finger (from right to left, leftto right, upward and/or downward) that has made contact with device 100.In some implementations or circumstances, inadvertent contact with agraphic does not select the graphic. For example, a swipe gesture thatsweeps over an application icon optionally does not select thecorresponding application when the gesture corresponding to selection isa tap.

Device 100 optionally also includes one or more physical buttons, suchas “home” or menu button 204. As described previously, menu button 204is, optionally, used to navigate to any application 136 in a set ofapplications that are, optionally executed on device 100. Alternatively,in some embodiments, the menu button is implemented as a soft key in aGUI displayed on touch screen 112.

In one embodiment, device 100 includes touch screen 112, menu button204, push button 206 for powering the device on/off and locking thedevice, volume adjustment button(s) 208, Subscriber Identity Module(SIM) card slot 210, head set jack 212, and docking/charging externalport 124. Push button 206 is, optionally, used to turn the power on/offon the device by depressing the button and holding the button in thedepressed state for a predefined time interval; to lock the device bydepressing the button and releasing the button before the predefinedtime interval has elapsed; and/or to unlock the device or initiate anunlock process. In an alternative embodiment, device 100 also acceptsverbal input for activation or deactivation of some functions throughmicrophone 113. Device 100 also, optionally, includes one or morecontact intensity sensors 165 for detecting intensity of contacts ontouch screen 112 and/or one or more tactile output generators 167 forgenerating tactile outputs for a user of device 100.

FIG. 3 is a block diagram of an exemplary multifunction device with adisplay and a touch-sensitive surface in accordance with someembodiments. Device 300 need not include the display and thetouch-sensitive surface, as described above, but rather, in someembodiments, optionally communicates with the display and thetouch-sensitive surface on other devices. Additionally, device 300 neednot be portable. In some embodiments, device 300 is a laptop computer, adesktop computer, a tablet computer, a multimedia player device (such asa television or a set-top box), a navigation device, an educationaldevice (such as a child's learning toy), a gaming system, or a controldevice (e.g., a home or industrial controller). Device 300 typicallyincludes one or more processing units (CPU's) 310, one or more networkor other communications interfaces 360, memory 370, and one or morecommunication buses 320 for interconnecting these components.Communication buses 320 optionally include circuitry (sometimes called achipset) that interconnects and controls communications between systemcomponents. Device 300 includes input/output (I/O) interface 330comprising display 340, which is typically a touch screen display. I/Ointerface 330 also optionally includes a keyboard and/or mouse (or otherpointing device) 350 and touchpad 355, tactile output generator 357 forgenerating tactile outputs on device 300 (e.g., similar to tactileoutput generator(s) 167 described above with reference to FIG. 1A),sensors 359 (e.g., optical, acceleration, proximity, touch-sensitive,and/or contact intensity sensors similar to contact intensity sensor(s)165 described above with reference to FIG. 1A). Memory 370 includeshigh-speed random access memory, such as DRAM, SRAM, DDR RAM or otherrandom access solid state memory devices; and optionally includesnon-volatile memory, such as one or more magnetic disk storage devices,optical disk storage devices, flash memory devices, or othernon-volatile solid state storage devices. Memory 370 optionally includesone or more storage devices remotely located from CPU(s) 310. In someembodiments, memory 370 stores programs, modules, and data structuresanalogous to the programs, modules, and data structures stored in memory102 of portable or non-portable multifunction device 100 (FIG. 1A), or asubset thereof. Furthermore, memory 370 optionally stores additionalprograms, modules, and data structures not present in memory 102 ofportable or non-portable multifunction device 100. For example, memory370 of device 300 optionally stores drawing module 380, presentationmodule 382, word processing module 384, website creation module 386,disk authoring module 388, and/or spreadsheet module 390, while memory102 of portable or non-portable multifunction device 100 (FIG. 1A)optionally does not store these modules.

Each of the above identified elements in FIG. 3 are, optionally, storedin one or more of the previously mentioned memory devices. Each of theabove identified modules corresponds to a set of instructions forperforming a function described above. The above identified modules orprograms (i.e., sets of instructions) need not be implemented asseparate software programs, procedures or modules, and thus varioussubsets of these modules are, optionally, combined or otherwisere-arranged in various embodiments. In some embodiments, memory 370optionally stores a subset of the modules and data structures identifiedabove. Furthermore, memory 370 optionally stores additional modules anddata structures not described above.

FIG. 4 illustrates an exemplary user interface on a device (e.g., device300, FIG. 3) with a touch-sensitive surface 451 (e.g., a tablet ortouchpad 355, FIG. 3) that is separate from the display 450 (e.g., touchscreen display 112). Device 300 also, optionally, includes one or morecontact intensity sensors (e.g., one or more of sensors 357) fordetecting intensity of contacts on touch-sensitive surface 451 and/orone or more tactile output generators 359 for generating tactile outputsfor a user of device 300.

Although some of the examples which follow will be given with referenceto inputs on touch screen display 112 (where the touch sensitive surfaceand the display are combined), in some embodiments, the device detectsinputs on a touch-sensitive surface that is separate from the display,as shown in FIG. 4. In some embodiments the touch sensitive surface(e.g., 451 in FIG. 4) has a primary axis (e.g., 452 in FIG. 4) thatcorresponds to a primary axis (e.g., 453 in FIG. 4) on the display(e.g., 450). In accordance with these embodiments, the device detectscontacts (e.g., 460 and 462 in FIG. 4) with the touch-sensitive surface451 at locations that correspond to respective locations on the display(e.g., in FIG. 4, 460 corresponds to 468 and 462 corresponds to 470). Inthis way, user inputs (e.g., contacts 460 and 462, and movementsthereof) detected by the device on the touch-sensitive surface (e.g.,451 in FIG. 4) are used by the device to manipulate the user interfaceon the display (e.g., 450 in FIG. 4) of the multifunction device whenthe touch-sensitive surface is separate from the display. It should beunderstood that similar methods are, optionally, used for other userinterfaces described herein.

Additionally, while the following examples are given primarily withreference to finger inputs (e.g., finger contacts, finger tap gestures,finger swipe gestures), it should be understood that, in someembodiments, one or more of the finger inputs are replaced with inputfrom another input device (e.g., a mouse based input or stylus input).For example, a swipe gesture is, optionally, replaced with a mouse click(e.g., instead of a contact) followed by movement of the cursor alongthe path of the swipe (e.g., instead of movement of the contact). Asanother example, a tap gesture is, optionally, replaced with a mouseclick while the cursor is located over the location of the tap gesture(e.g., instead of detection of the contact followed by ceasing to detectthe contact). Similarly, when multiple user inputs are simultaneouslydetected, it should be understood that multiple computer mice are,optionally, used simultaneously, or a mouse and finger contacts are,optionally, used simultaneously.

As used herein, the term “focus selector” refers to an input elementthat indicates a current part of a user interface with which a user isinteracting. In some implementations that include a cursor or otherlocation marker, the cursor acts as a “focus selector,” so that when aninput (e.g., a press input) is detected on a touch-sensitive surface(e.g., touchpad 355 in FIG. 3 or touch-sensitive surface 451 in FIG. 4)while the cursor is over a particular user interface element (e.g., abutton, window, slider or other user interface element), the particularuser interface element is adjusted in accordance with the detectedinput. In some implementations that include a touch-screen display(e.g., touch-sensitive display system 112 in FIG. 1A or touch screen 112in FIG. 4A) that enables direct interaction with user interface elementson the touch-screen display, a detected contact on the touch-screen actsas a “focus selector,” so that when an input (e.g., a press input by thecontact) is detected on the touch-screen display at a location of aparticular user interface element (e.g., a button, window, slider orother user interface element), the particular user interface element isadjusted in accordance with the detected input. In some implementationsfocus is moved from one region of a user interface to another region ofthe user interface without corresponding movement of a cursor ormovement of a contact on a touch-screen display (e.g., by using a tabkey or arrow keys to move focus from one button to another button); inthese implementations, the focus selector moves in accordance withmovement of focus between different regions of the user interface.Without regard to the specific form taken by the focus selector, thefocus selector is generally the user interface element (or contact on atouch-screen display) that is controlled by the user so as tocommunicate the user's intended interaction with the user interface(e.g., by indicating, to the device, the element of the user interfacewith which the user is intending to interact). For example, the locationof a focus selector (e.g., a cursor, a contact or a selection box) overa respective button while a press input is detected on thetouch-sensitive surface (e.g., a touchpad or touch screen) will indicatethat the user is intending to activate the respective button (as opposedto other user interface elements shown on a display of the device).

In some embodiments described herein, one or more operations areperformed in response to detecting a gesture that includes a respectivepress input or in response to detecting the respective press inputperformed with a respective contact (or a plurality of contacts), wherethe respective press input is detected based at least in part ondetecting an increase in intensity of the contact (or plurality ofcontacts) above a press-input intensity threshold. In some embodiments,the respective operation is performed in response to detecting theincrease in intensity of the respective contact above the press-inputintensity threshold (e.g., a “down stroke” of the respective pressinput). In some embodiments, the press input includes an increase inintensity of the respective contact above the press-input intensitythreshold and a subsequent decrease in intensity of the contact belowthe press-input intensity threshold, and the respective operation isperformed in response to detecting the subsequent decrease in intensityof the respective contact below the press-input threshold (e.g., an “upstroke” of the respective press input).

In some embodiments, the device employs intensity hysteresis to avoidaccidental inputs sometimes termed “jitter,” where the device defines orselects a hysteresis intensity threshold with a predefined relationshipto the press-input intensity threshold (e.g., the hysteresis intensitythreshold is X intensity units lower than the press-input intensitythreshold or the hysteresis intensity threshold is 75%, 90% or somereasonable proportion of the press-input intensity threshold). Thus, insome embodiments, the press input includes an increase in intensity ofthe respective contact above the press-input intensity threshold and asubsequent decrease in intensity of the contact below the hysteresisintensity threshold that corresponds to the press-input intensitythreshold, and the respective operation is performed in response todetecting the subsequent decrease in intensity of the respective contactbelow the hysteresis intensity threshold (e.g., an “up stroke” of therespective press input). Similarly, in some embodiments, the press inputis detected only when the device detects an increase in intensity of thecontact from an intensity at or below the hysteresis intensity thresholdto an intensity at or above the press-input intensity threshold and,optionally, a subsequent decrease in intensity of the contact to anintensity at or below the hysteresis intensity, and the respectiveoperation is performed in response to detecting the press input (e.g.,the increase in intensity of the contact or the decrease in intensity ofthe contact, depending on the circumstances).

For ease of explanation, the description of operations performed inresponse to a press input associated with a press-input intensitythreshold or in response to a gesture including the press input are,optionally, triggered in response to detecting either: an increase inintensity of a contact above the press-input intensity threshold, anincrease in intensity of a contact from an intensity below thehysteresis intensity threshold to an intensity above the press-inputintensity threshold, a decrease in intensity of the contact below thepress-input intensity threshold, and/or a decrease in intensity of thecontact below the hysteresis intensity threshold corresponding to thepress-input intensity threshold. Additionally, in examples where anoperation is described as being performed in response to detecting adecrease in intensity of a contact below the press-input intensitythreshold, the operation is, optionally, performed in response todetecting a decrease in intensity of the contact below a hysteresisintensity threshold corresponding to, and lower than, the press-inputintensity threshold.

FIG. 5 illustrates a block diagram of exemplary electronic devices incommunication with each other in accordance with some embodiments. Firstelectronic device 500 and second electronic device 506 are optionallyany electronic device, such as multifunction devices 100 or 300, asillustrated in FIGS. 1A-B, 2, and 3 (e.g., phones, tablets, laptopcomputers, set-top boxes, etc.). In some embodiments, the devices 500and 506 are in communication with each other either directly or over anetwork 512 (e.g., a local area network, a wide area network such as theInternet, etc.).

In some embodiments, one or more of devices 500 and 506 automaticallycrop video content captured by cameras 504 and/or 510 for display ondisplays 502 and/or 508. For example, the first electronic device 500automatically crops video content captured by camera 504 and sends thecropped video content to the second electronic device 506 for display onthe display 508. In another example, the first electronic device 500receives video content captured by the camera 510 from the secondelectronic device 506, and the first electronic device 506 displays thecropped video content on the display 502.

User Interfaces and Associated Processes

Electronic devices are often equipped with a camera for capturing videocontent and/or a display for displaying video content. However, amateurusers often capture video content without regard to composition,framing, or camera movement, resulting in video content that can bejarring or confusing to viewers. There is a need to automate theprocessing and presentation of video content in an aestheticallypleasing manner. The embodiments described below provide a method ofautomatically cropping video content for presentation on a display.

FIGS. 6A-6LL illustrate exemplary video content automatically cropped inaccordance with some embodiments. The video content in these figures areused to illustrate the processes described below, including theprocesses described below with reference to FIGS. 7A-7D.

FIG. 6A illustrates exemplary video content 602 as captured by a camera(e.g., cameras 504 or 510 in FIG. 5) and displayed on a display 600(e.g., displays 200, 340, 502, or 508 in FIGS. 2-5). The video content602 includes subject 606 and is cropped according to a cropping 604. Thedisplay 600 optionally only displays portions of the video content 602within the cropping 604 and does not display portions of the videocontent that are outside the cropping 604.

FIGS. 6B-6G illustrate determining a cropping 604 based on a subject 606in the video content 602. For example, the cropping is optionallydetermined so as to keep the subject 606 within a threshold area(discussed in further detail below) in the cropped video content. Insome embodiments, the threshold area is optionally defined by thecropping itself. In FIG. 6B, the torso of the subject 606 is within thecropping 604. In FIG. 6C, the torso of the subject 606 has moved outsidethe cropping 604. In response to movement outside the threshold area, anew cropping is optionally determined, as illustrated in FIGS. 6D and6E. In FIG. 6D, the cropping 604 is panned with respect to the croppingFIG. 6C (e.g., the cropping 604 has the same dimensions in both FIGS. 6Cand 6D, but the cropping is at a different position in FIG. 6D). In FIG.6E, the cropping 604 is zoomed out with respect to the cropping in FIG.6C (e.g., the cropping has increased in size in FIG. 6E compared to FIG.6C).

In some embodiments, a threshold area is optionally defined as smalleror larger than the cropping itself. For example, the threshold area isoptionally defined by a rectangle surrounding an inner third of thecropping area. In FIG. 6F, the subject 606 enters the inner thirdthreshold area 624 of the cropping 604. In response to the movement ofthe subject into the threshold area 624, a new cropping is optionallydetermined, as illustrated in FIG. 6G. FIG. 6G illustrates a cropping604 that is zoomed in with respect to the cropping in FIG. 6F.

FIGS. 6H-6R illustrate determining a cropping 604 based on a determinedrange of movement of the subject 606 in the video content 602. Forexample, a range of movement is optionally determined based on themovement of the subject from FIG. 6H to FIG. 6I. The range of movementand its relationship to a cropping is optionally determined usinginformation about the environment in the field of view of the cameraobtained from one or more sensors, and/or information from the videocontent itself, as will be explained in further detail below. In thisexample, the range of movement (1) does not fall outside the cropping604 and (2) spans from one edge of the cropping to the other (e.g., theendpoints of the movement fall within a threshold distance of the edgesof the cropping). In response to this determination, the cropping 604 isoptionally maintained.

In contrast, the range of movement of the subject 606 from FIG. 6J toFIG. 6K is much shorter than the length of the cropping 604 (e.g., theendpoints of the movement do not fall within the threshold distance ofthe edges of the cropping). In response to a determination of the rangeof movement from FIG. 6J to FIG. 6K, a new cropping 604 is optionallydetermined as illustrated in FIG. 6L that corresponds with the smallerrange of movement. The cropping in FIG. 6L allows the subject 606 tooccupy more of the frame while still keeping the subject in the frame aslong as the range of movement is maintained. In FIGS. 6M and 6N, therange of movement of the subject 606 brings the subject outside thecropping 604. In response to determining that the range of movementexceeds the bounds of the cropping 604, a new cropping 604 is optionallydetermined as illustrated in FIG. 60 that is expanded to correspond tothe larger range of movement.

In some embodiments, a cropping is optionally determined in response toa determination that the subject has stopped moving. For example, FIG.6P illustrates a moving subject 606. In FIG. 6Q, the subject 606 hasstopped moving. In response to determining that the subject 606 hasstopped moving, a new cropping 604 is optionally determined asillustrated in FIG. 6R that is zoomed in on the stationary subject 606with respect to the cropping in FIG. 6Q.

FIGS. 6S-6EE illustrate determining a cropping based on a determinationthat a subject 606 is active or inactive. Active or inactive status isoptionally determined based on the existence of objects identified inthe frame, their relationship to each other, their positions ormovements within the frame, or other information as will be discussed infurther detail below. In some embodiments, activity status is optionallydetermined based on a subject using another electronic device, such as aphone.

FIG. 6S illustrates subjects 606 and 608 in video content 602. FIG. 6Tillustrates subject 606 using phone 610. In response to determining thatthe subject 606 is using a phone (e.g., based on information receivedfrom the phone 610, or based on image recognition of the phone in thehand of subject 606, etc.), it is optionally determined that the subject606 is inactive. In accordance with a determination that the subject 606is inactive, the subject 606 is optionally deemphasized in a newcropping 604 illustrated in FIG. 6U, where the cropping 604 is zoomed inon the subject 608 and the subject 606 is excluded from the cropping.

In FIG. 6V, the subject 606 has stopped using the phone 610. In responseto determining that the subject 606 is no longer using a phone (e.g.,based on information received from the phone 610, or based on imagerecognition that the phone is no longer in the hand of subject 606,etc.), it is optionally determined that the subject 606 is active. Inaccordance with a determination that the subject 606 is active, thesubject 606 is optionally emphasized in a new cropping 604 illustratedin FIG. 6W, where the cropping 604 is zoomed out to include bothsubjects 606 and 608.

In some embodiments, activity status is optionally determined based ongestures, such as hand gestures. FIG. 6X illustrates subject 606 makinga first gesture 612 indicating an inactive status (e.g., holding up asingle finger). In response to the first gesture 612, the subject 606 isoptionally deemphasized in a new cropping 604 illustrated in FIG. 6Y.FIG. 6Z illustrates subject 606 making a second gesture 614 indicatingan active status (e.g., waving a hand). In response to the secondgesture 614, the subject 606 is optionally emphasized in a new cropping604 illustrated in FIG. 6AA.

In some embodiments, activity status is optionally determined based onposture of a subject and/or movement of a subject. FIG. 6BB illustratessubject 606 facing away from subject 608, indicating a desire to leaveand, thereby, an inactive status. Optionally, subject 606 may be walkingaway from subject 608, indicating an inactive status. In response to theposture and/or movement of subject 606, the subject 606 is optionallydeemphasized in a new cropping 604 illustrated in FIG. 6CC. FIG. 6DDillustrates the subject 606 facing towards the subject 608, indicatingengagement with subject 608 and an active status. Optionally, subject606 may be walking toward the subject 608, indicating an active status.In response to the posture and/or movement of subject 606, the subject606 is optionally emphasized in a new cropping 604 in FIG. 6EE.

In some embodiments, a transition from first cropped video content tosecond cropped video content is optionally generated. For example, FIG.6FF illustrates a first cropping and FIG. 6II illustrates a secondcropping. A jump cut from the cropping in FIG. 6FF to the cropping inFIG. 6II might be jarring for a viewer. One or more interveningcroppings are optionally generated to simulate a smooth zoom from, asillustrated in sequence from FIGS. 6FF-6II to create a moreaesthetically pleasing viewing experience.

FIGS. 6JJ-6LL illustrate determining a cropping based on a physical sizeof a display such that the subject is scaled to life-size on thedisplay. For example, the device determines a physical size of thedisplay (e.g., based on metadata about the display), determines aphysical size of the subject's face (e.g., based on information from adepth camera), and determines a cropping that will scale the subject'sface so that when it is displayed on the display, the face will havephysical dimensions that are approximately the same as the subject'sface.

FIG. 6JJ illustrates video content 602 including subject 606 and croppedaccording to the cropping 604. The cropped video content is displayed ondisplay 618, and the face of the subject 606 on the display 618 isillustrated as having approximately the same physical dimensions as thesubject's actual face. FIGS. 6KK and 6LL illustrate different sizeddisplays 620 and 622 and corresponding croppings. As illustrated inFIGS. 6JJ-6LL, on a smaller display the cropping is closer to thesubject's face so that the subject's face takes up more of the display,whereas on a larger display, the cropping is further from the subject'sface so that the subject's face takes up less of the display. Thus, insome embodiments, the cropping of the video is based at least in part ona size of the display on which the cropped video is to be displayed. Insome embodiments, when the video is switched from a first display to asecond display that is a different size from the first display, thecropping changes to account for the different sizes of the first andsecond displays.

FIGS. 7A-7D are flow diagrams illustrating a method for automaticallycropping video content in accordance with some embodiments. The methodis optionally performed at an electronic device as described above withreference to FIGS. 1A-B and 2-5 (e.g., electronic device 100, 300, 500,or 506, etc.). Optional or alternative operations in FIGS. 7A-7D areindicated in dashed boxes. Some operations in method 700 are,optionally, combined and/or the order of some operations is, optionally,changed.

As described below, the method 700 provides ways automatically croppingvideo content. The method reduces the cognitive burden on a user wheninteracting with a user interface on the device by automaticallyselecting a desirable cropping presentation for video content so thatthe user does not need to manually determine an appropriate cropping,thereby creating a more efficient human-machine interface. Forbattery-operated electronic devices, increasing the efficiency of theuser's interaction with the user interfaces conserves power andincreases the time between battery charges.

An electronic device 500 (e.g., a set top box or other device that isoptionally in communication with a display device and/or a camera) withone or more processors and memory receives (702) video content (e.g.,video content 602) captured by a camera (e.g., camera 504 or 510). Theelectronic device optionally receives (704) first information about anenvironment that is in a field of view of the camera from one or moresensors associated with the camera (e.g., a microphone, a depth camera,a thermal camera, a video camera, or other sensors, including sensors ofa local device or a remote device separate from the electronic devicethat are sensing objects and/or activity in the field of view of thecamera).

The electronic device determines (710) a first cropping (e.g., cropping604) for the video content based on one or more of the first informationfrom the one or more sensors and the video content (e.g., imagerecognition information), and generates (722) first cropped videocontent for presentation on a display (e.g., a display 508 of a anadditional device 506 or a display 502 of the electronic device 500)based on the determined first cropping. For example, FIG. 6A illustratesvideo content 602 cropped according to cropping 604 displayed on display600.

In some embodiments, the electronic device 500 optionally receives (724)second information about a change in the environment that is in thefield of view of the camera from one or more sensors associated with thecamera, and/or a change in the video content, determines (734) a secondcropping for the video content based on the second information, andgenerates (750) second cropped video content for presentation on thedisplay based on the determined second cropping.

In some embodiments, the electronic device 500 optionally determines(726), based on the second information and/or changes in the videocontent, that a subject (e.g., a person such as subject 606 in FIG. 6C)in the environment has moved out of a threshold area of a frame of thevideo content (e.g., the threshold area is defined by a rectanglesurrounding the inner two-thirds of the frame that corresponds to thefirst cropping of the vide content, among other possibilities), and thesecond cropping is determined (752) so as to keep the subject within thethreshold area in the second cropped video content (e.g., as illustratedin FIGS. 6D and 6E, showing new croppings 604 that keep the subject 606within the cropping). In some embodiments, the second cropping is panned(736) with respect to the first cropping (as illustrated in FIG. 6D). Insome embodiments, the second cropping is zoomed out (738) with respectto the first cropping (as illustrated in FIG. 6E).

In some embodiments, the electronic device 500 determines that a subject(e.g., a person such as subject 606 in FIG. 6F) in the environment iswithin a threshold area of a frame of the video content (e.g., thethreshold area is defined by a rectangle surrounding the inner one-thirdof the frame, among other possibilities), the second cropping isdetermined in response to determining that the subject in theenvironment is within the threshold area, and the second cropping iszoomed in with respect to the first cropping (as illustrated in FIG. 6G,showing a new cropping 604 that is zoomed in on subject 606).

In some embodiments, the electronic device 500 determines (728), basedon the second information and/or changes in the video content, that asubject (e.g., a person such as subject 606 in FIG. 6Q) in theenvironment has stopped moving, and the second cropping is determined(740) in response to determining that the subject in the environment hasstopped moving (as illustrated in FIG. 6R, showing a new cropping 604that is zoomed in on the stationary subject 606).

In some embodiments, the electronic device 500 determines (754) thatcropping criteria are met (e.g., a subject has moved outside a thresholdarea or within a threshold area, a subject has stopped moving, etc.). Inresponse to determining that the cropping criteria are met (756), inaccordance with a determination that an elapsed time since the firstcropping was determined does not exceed a predetermined threshold timeperiod, the electronic device maintains (758) the first cropping (e.g.,so that the image is not re-cropped too often). In accordance with adetermination that the elapsed time exceeds the predetermined thresholdtime period, the electronic device provides (760) the second croppedvideo content for presentation on the display (e.g., once the thresholdtime period is met, automatically re-crop in response to determiningthat the cropping criteria are met).

In some embodiments, the electronic device 500 determines (730), basedon the first or second information and/or the video content whether asubject in the environment is active or inactive (e.g., based oninformation indicating that the subject 606 is on the phone 610, asillustrated in FIG. 6T; in response to a gesture 612 or 614 with hand,device, or object, as illustrated in FIGS. 6X and 6Z; or recognizingposture/movement of a subject getting up to leave or returning to theconversation, as illustrated in FIGS. 6BB and 6DD). In accordance with adetermination that the subject is active, the electronic deviceemphasizes (742) (e.g. zooming in on) the subject in the second croppingwith respect to the first cropping (e.g., to focus on the active subject606, as illustrated in FIGS. 6W, 6AA, and 6EE). In accordance with adetermination that the subject is inactive, the electronic devicedeemphasizes (744) the subject in the second cropping with respect tothe first cropping. (e.g., cropping out the inactive subject 606, asillustrated in FIGS. 6U, 6Y, and 6CC).

In some embodiments, the electronic device 500 determines (732), basedon the second information, a range of movement of a subject in theenvironment (e.g., determine bounds of the range of movement of thesubject and/or determine if the movement is outside the first croppingfor the video content, as illustrated in FIGS. 6H-6R). The secondcropping is optionally zoomed out (746) with respect to the firstcropping (e.g., zoomed out to fit a relatively large range of movement,as illustrated in FIG. 60) in accordance with a determination that therange of movement of the subject is larger than a predeterminedthreshold (e.g., a threshold determined based on the first cropping forthe video content, such as the cropping in FIGS. 6M and 6N). The secondcropping is optionally zoomed in (748) with respect to the firstcropping (e.g., zoomed out to fit a relatively small range of movement,as illustrated in FIG. 6L) in accordance with a determination that therange of movement of the subject is smaller than the predeterminedthreshold (e.g., as illustrated in FIGS. 6J and 6K).

In some embodiments, the electronic device 500 generates (762) atransition from the first cropped video content to the second croppedvideo content (e.g., generating one or more intervening croppingsbetween the first cropping and the second cropping to simulate a smoothzoom and/or pan, as illustrated in FIGS. 6FF-6II).

In some embodiments, the display has one or more characteristics (e.g.,a physical size, a resolution, a color depth, or other displaycharacteristics) and determining the first cropping is based on the oneor more characteristics of the display (712). In some embodiments,determining the first cropping includes scaling (714) a subject in theenvironment to life-size on the display. For example, the devicedetermines a physical size of the display, determines a physical size ofthe subject's face, and determines a cropping that will scale thesubject's face so that when it is displayed on the display, the facewill have physical dimensions that are approximately the same as thesubject's face. Thus, on a smaller display the cropping would be closerto the subject's face so that the subject's face takes up more of thedisplay, whereas on a larger display, the cropping would be further fromthe subject's face so that the subject's face takes up less of thedisplay, as illustrated in FIGS. 6JJ-6LL. Thus, in some embodiments, thecropping of the video is based at least in part on a size of the displayon which the cropped video is to be displayed. In some embodiments, whenthe video is switched from a first display to a second display that is adifferent size from the first display, the cropping changes to accountfor the different sizes of the first and second displays.

In some embodiments, the electronic device 500 determines (716)respective locations of a plurality of subjects in a frame of the videocontent, wherein the first cropping is determined based on therespective locations (e.g., so that the plurality of the subjects arewithin the cropped frame, as illustrated in FIGS. 6S, 6W, 6AA, and 6EE).

In some embodiments, the electronic device 500 selects (706) a croppingstyle (e.g., based on user input or automatically), and the firstcropping is determined based on the selected cropping style (718). Forexample, a style optionally specifies one or more of a frequency ofre-determining croppings, a speed of transition between croppings, or atightness of zoom on subjects, etc. In some embodiments, the croppingstyle is automatically selected (708) based on cropping style criteria.For example, if subjects move out of the frame more than a thresholdnumber of times during a predetermined time period (e.g., more thanthree times in a minute), then a new cropping style is optionallyselected that does not zoom in as tightly on subjects as in a previouslyselected cropping style. This newly selected cropping style therebyresults in less frequent re-determinations of croppings, avoiding userconfusion and discomfort.

In some embodiments, determining the first cropping is further based onmetadata (720) of the video content (e.g., resolution of the videocontent, field of view of the video camera). For example, if theresolution of the video content is relatively high, then more of thecontent can be cropped out without sacrificing image quality, whereas ifthe resolution of the video content is relatively low, the contentcannot be too tightly cropped without resulting in an image that is ofpoor quality due to low resolution. In some embodiments, the firstcropping includes an entire frame of the video content (e.g., nothing iscropped out of the frame), and the electronic device 500 downsamples(764) the first cropped video content to match a resolution of thedisplay. In some embodiments, the video content has a much higherresolution (e.g., 2×, 4×, 8×) than the display to enable the videocontent to be cropped at a number of different scale factors while stilldisplaying video at native resolution. Thus, in some circumstances, thevideo content at the first cropping and the video content at the secondcropping are both output at the same resolution (e.g., the resolution ofthe display), because the video content is downsampled at both the firstcropping and the second cropping to the resolution of the display.

In some embodiments, the electronic device 500 connects (766) a videocall between the electronic device 500 and an additional electronicdevice 506. In some embodiments, two different devices (e.g., devices500 and 506) are sharing video content in a two way video chat and thevideo captured by a camera 502 of the first device 500 is being pannedand/or cropped for the display 508 of the second device 506 while thevideo content captured by a camera 510 of the second device 506 is beingpanned and/or cropped for the display 502 of the first device 500. Insome embodiments, the video content from the camera 504 of the firstdevice 500 is being downsampled, panned and/or cropped at the firstdevice 500 before being sent to the second device 502 (e.g., so as toconserve bandwidth) and the video content from the camera 510 of thesecond device 506 is being downsampled, panned and/or cropped at thesecond device 506 before being sent to the first device 500 (e.g., so asto conserve bandwidth). In such a situation the first device 500 wouldoptionally provide the second device 506 with information about a sizeand/or resolution of its display 502 so that the second device 506 couldtake that information into account when downsampling, panning, and/orcropping the video content sent to the first device 500, and vice versa.In some embodiments, the video content from the camera 504 of the firstdevice 500 is being downsampled, panned and/or cropped at the seconddevice 506 after being received from the first device 500 (e.g., so asto improve response time) and the video content from the camera 510 ofthe second device 506 is being downsampled, panned and/or cropped at thefirst device 500 after being received from the second device 506 (e.g.,so as to improve response time).

In some embodiments, the camera 504 that captures the video content isconnected (768) to the electronic device 500 that generates the firstcropped video content and the display 508 is connected to the additionalelectronic device 506. In some embodiments, the camera 510 that capturesthe video content is connected (770) to the additional electronic device506 and the display 502 is connected to the electronic device 500 thatgenerates the first cropped video content.

The operations in the information processing methods described aboveare, optionally, implemented by running one or more functional modulesin an information processing apparatus such as general purposeprocessors (e.g., as described above with respect to FIGS. 1A and 3) orapplication specific chips.

The operations described above with reference to FIGS. 7A-7D are,optionally, implemented by components depicted in FIGS. 1A-1B. Forexample, receiving operation 702, receiving operation 704, determiningoperation 710, and generating operation 722 are, optionally, implementedby event sorter 170, event recognizer 180, and event handler 190. Eventmonitor 171 in event sorter 170 detects a contact on touch-sensitivedisplay 112, and event dispatcher module 174 delivers the eventinformation to application 136-1. A respective event recognizer 180 ofapplication 136-1 compares the event information to respective eventdefinitions 186, and determines whether a first contact at a firstlocation on the touch-sensitive surface corresponds to a predefinedevent or sub-event, such as selection of an object on a user interface.When a respective predefined event or sub-event is detected, eventrecognizer 180 activates an event handler 190 associated with thedetection of the event or sub-event. Event handler 190 optionallyutilizes or calls data updater 176 or object updater 177 to update theapplication internal state 192. In some embodiments, event handler 190accesses a respective GUI updater 178 to update what is displayed by theapplication. Similarly, it would be clear to a person having ordinaryskill in the art how other processes can be implemented based on thecomponents depicted in FIGS. 1A-1B.

It should be understood that the particular order in which theoperations in FIGS. 7A-7D have been described is merely exemplary and isnot intended to indicate that the described order is the only order inwhich the operations could be performed. One of ordinary skill in theart would recognize various ways to reorder the operations describedherein.

In accordance with some embodiments, FIG. 8 shows a functional blockdiagram of a first electronic device 800 configured in accordance withprinciples of the various described embodiments. The functional blocksof the device are, optionally, implemented by hardware, software, or acombination of hardware and software to carry out the principles of thevarious described embodiments. It is understood by persons of skill inthe art that the functional blocks described in FIG. 8 are, optionally,combined or separated into sub-blocks to implement the principles of thevarious described embodiments. Therefore, the description hereinoptionally supports any possible combination or separation or furtherdefinition of the functional blocks described herein.

As shown in FIG. 8, an electronic device 800 optionally includes adisplay unit 802 configured to display cropped video content; a cameraunit 804 to capture video content; and a communications unit 806 to sendand receive data from additional electronic devices. In someembodiments, the processing unit 808 optionally includes a displayenabling unit 810, a receiving unit 812, and a determining unit 814.

In some embodiments, the processing unit 808 is configured to receive(e.g., with the receiving unit 812) video content captured by a camera,receive (e.g., with the receiving unit 812) first information about anenvironment that is in a field of view of the camera from one or moresensors associated with the camera, determine (e.g., with thedetermining unit 814) a first cropping for the video content based onthe first information from the one or more sensors and the videocontent, and generate (e.g., with the display enabling unit 810) firstcropped video content for presentation on a display based on thedetermined first cropping.

In some embodiments, the processing unit 808 is further configured toreceive (e.g., with the receiving unit 812) second information about achange in the environment that is in the field of view of the camerafrom one or more sensors associated with the camera, determine (e.g.,with the determining unit 814) a second cropping for the video contentbased on the second information, and generate (e.g., with the displayenabling unit 810) second cropped video content for presentation on thedisplay based on the determined second cropping.

In some embodiments, the processing unit 808 is further configured todetermine (e.g., with the determining unit 814), based on the secondinformation, that a subject in the environment has moved out of athreshold area of a frame of the video content wherein the secondcropping is determined so as to keep the subject within the thresholdarea in the second cropped video content. In some embodiments, thesecond cropping is panned with respect to the first cropping. In someembodiments, the second cropping is zoomed out with respect to the firstcropping.

In some embodiments, the processing unit 808 is further configured todetermine (e.g., with the determining unit 814), based on the secondinformation, that a subject in the environment has stopped moving,wherein the second cropping is determined in response to determiningthat the subject in the environment has stopped moving.

In some embodiments, the processing unit 808 is further configured todetermine (e.g., with the determining unit 814) that a subject in theenvironment is within a threshold area of a frame of the video content,wherein the second cropping is determined in response to determiningthat the subject in the environment is within the threshold area, andthe second cropping is zoomed in with respect to the first cropping.

In some embodiments, the first cropping is determined at a first time,and the processing unit 808 is further configured to determine (e.g.,with the determining unit 814) that cropping criteria are met, and, inresponse to determining that the cropping criteria are met, inaccordance with a determination that an elapsed time since the firsttime does not exceed a predetermined threshold time period, maintain(e.g., with the display enabling unit 810) the first cropping, and inaccordance with a determination that the elapsed time exceeds thepredetermined threshold time period, provide (e.g., with the displayenabling unit 810) the second cropped video content for presentation onthe display.

In some embodiments, the processing unit 808 is further configured todetermine (e.g., with the determining unit 814), based on the secondinformation whether a subject in the environment is active or inactive,in accordance with a determination that the subject is active, emphasize(e.g., with the display enabling unit 810) the subject in the secondcropping with respect to the first cropping, and in accordance with adetermination that the subject is inactive, deemphasize (e.g., with thedisplay enabling unit 810) the subject in the second cropping withrespect to the first cropping.

In some embodiments, the processing unit 808 is further configured todetermine (e.g., with the determining unit 814), based on the secondinformation, a range of movement of a subject in the environment,wherein the second cropping is zoomed out with respect to the firstcropping in accordance with a determination that the range of movementof the subject is larger than a predetermined threshold, and wherein thesecond cropping is zoomed in with respect to the first cropping inaccordance with a determination that the range of movement of thesubject is smaller than the predetermined threshold.

In some embodiments, the processing unit 808 is further configured togenerate (e.g., with the display enabling unit 810) a transition fromthe first cropped video content to the second cropped video content.

In some embodiments, the display has one or more characteristics, anddetermining the first cropping is based on the one or morecharacteristics of the display. In some embodiments, determining thefirst cropping includes scaling a subject in the environment tolife-size on the display.

In some embodiments, the processing unit 808 is further configured to:determine (e.g., with the determining unit 814) respective locations ofa plurality of subjects in a frame of the video content, wherein thefirst cropping is determined based on the respective locations.

In some embodiments, the processing unit 808 is further configured toselect (e.g., with the determining unit 814) a cropping style, whereinthe first cropping is determined based on the selected cropping style.In some embodiments, the cropping style is automatically selected basedon cropping style criteria. In some embodiments, determining the firstcropping is further based on metadata of the video content. In someembodiments, the first cropping includes an entire frame of the videocontent, and the processing unit 808 is further configured to downsample(e.g., with the display enabling unit 810) the first cropped videocontent to match a resolution of the display.

In some embodiments, the processing unit 808 is further configured toconnect (e.g., with the receiving unit 812) a video call between theelectronic device and an additional electronic device. In someembodiments, the camera that captures the video content is connected tothe electronic device that generates the first cropped video content andthe display is connected to the additional electronic device. In someembodiments, the camera that captures the video content is connected tothe additional electronic device and the display is connected to theelectronic device that generates the first cropped video content.

The foregoing description, for purpose of explanation, has beendescribed with reference to specific embodiments. However, theillustrative discussions above are not intended to be exhaustive or tolimit the invention to the precise forms disclosed. Many modificationsand variations are possible in view of the above teachings. Theembodiments were chosen and described in order to best explain theprinciples of the invention and its practical applications, to therebyenable others skilled in the art to best use the invention and variousdescribed embodiments with various modifications as are suited to theparticular use contemplated.

The invention claimed is:
 1. A method comprising: at a first electronicdevice in communication with a display device and a camera: displaying,via the display device, a first respective portion of video contentcorresponding to a video call including the first electronic device anda second electronic device, wherein a second portion of the videocontent is not displayed, the video content captured by the camera incommunication with the first electronic device includes a first subject,and the first respective portion of the video content includes the firstsubject but not a second subject; and while displaying the firstrespective portion of the video content including the first subject: inaccordance with a determination that the second subject in addition tothe first subject is active in the video content captured by the cameraof the first electronic device, updating the display to display a secondrespective portion, different from the first respective portion, of thevideo content captured by the camera of the first electronic device,wherein the second respective portion of the video content includes thefirst subject and the second subject.
 2. The method of claim 1, furthercomprising: while displaying the second respective portion of the videocontent including the first subject and the second subject: inaccordance with a determination that the second subject is active,maintaining the second subject in the second respective portion of thevideo content; and in accordance with a determination that the secondsubject is inactive, updating the display to display a third respectiveportion of the video content, wherein the third respective portion ofthe video content does not include the second subject.
 3. The method ofclaim 1, wherein the determination that the second subject is active isbased on image recognition of the video content.
 4. The method of claim1, wherein the determination that the second subject is active is basedon a determination that the second subject is engaged with anothersubject.
 5. The method of claim 1, further comprising: during the videocall including the first electronic device and the second electronicdevice, after determining that the second subject is active andincluding the second subject in the second respective portion of thevideo content: determining that the first subject is inactive; and inresponse to determining that the first subject is inactive, updating thedisplay to display a third respective portion of the video content,wherein the third respective portion of the video content does notinclude the first subject.
 6. The method of claim 1, wherein thedetermination that the second subject is active is based on movement ofthe second subject in the video content.
 7. A non-transitorycomputer-readable storage medium including instructions, which whenexecuted by a first electronic device in communication with a displaydevice and a camera, cause the first electronic device to perform amethod comprising: displaying, via the display device, a firstrespective portion of video content corresponding to a video callincluding the first electronic device and a second electronic device,wherein a second portion of the video content is not displayed, thevideo content captured by the camera in communication with the firstelectronic device includes a first subject, and the first respectiveportion of the video content includes the first subject but not a secondsubject; and while displaying the first respective portion of the videocontent including the first subject: in accordance with a determinationthat the second subject in addition to the first subject is active inthe video content captured by the camera of the first electronic device,updating the display to display a second respective portion, differentfrom the first respective portion, of the video content captured by thecamera of the first electronic device, wherein the second respectiveportion of the video content includes the first subject and the secondsubject.
 8. The non-transitory computer-readable storage medium of claim7, the method further comprising: while displaying the second respectiveportion of the video content including the first subject and the secondsubject: in accordance with a determination that the second subject isactive, maintaining the second subject in the second respective portionof the video content; and in accordance with a determination that thesecond subject is inactive, updating the display to display a thirdrespective portion of the video content, wherein the third respectiveportion of the video content does not include the second subject.
 9. Thenon-transitory computer-readable storage medium of claim 7, wherein thedetermination that the second subject is active is based on imagerecognition of the video content.
 10. The non-transitorycomputer-readable storage medium of claim 7, wherein the determinationthat the second subject is active is based on a determination that thesecond subject is engaged with another subject.
 11. The non-transitorycomputer-readable storage medium of claim 7, the method furthercomprising: during the video call including the first electronic deviceand the second electronic device, after determining that the secondsubject is active and including the second subject in the secondrespective portion of the video content: determining that the firstsubject is inactive; and in response to determining that the firstsubject is inactive, updating the display to display a third respectiveportion of the video content, wherein the third respective portion ofthe video content does not include the first subject.
 12. Thenon-transitory computer-readable storage medium of claim 7, wherein thedetermination that the second subject is active is based on movement ofthe second subject in the video content.
 13. A first electronic devicecomprising: one or more processors; and memory storing instructions,which when executed by the one or more processors, cause the firstelectronic device to perform a method comprising: displaying, via adisplay device, a first respective portion of video contentcorresponding to a video call including the first electronic device anda second electronic device, wherein a second portion of the videocontent is not displayed, the video content captured by a camera incommunication with the first electronic device includes a first subject,and the first respective portion of the video content includes the firstsubject but not a second subject; and while displaying the firstrespective portion of the video content including the first subject: inaccordance with a determination that the second subject in addition tothe first subject is active in the video content captured by the cameraof the first electronic device, updating the display to display a secondrespective portion, different from the first respective portion, of thevideo content captured by the camera of the first electronic device,wherein the second respective portion of the video content includes thefirst subject and the second subject.
 14. The first electronic device ofclaim 13, the method further comprising: while displaying the secondrespective portion of the video content including the first subject andthe second subject: in accordance with a determination that the secondsubject is active, maintaining the second subject in the secondrespective portion of the video content; and in accordance with adetermination that the second subject is inactive, updating the displayto display a third respective portion of the video content, wherein thethird respective portion of the video content does not include thesecond subject.
 15. The first electronic device of claim 13, wherein thedetermination that the second subject is active is based on imagerecognition of the video content.
 16. The first electronic device ofclaim 13, wherein the determination that the second subject is active isbased on a determination that the second subject is engaged with anothersubject.
 17. The first electronic device of claim 13, the method furthercomprising: during the video call including the first electronic deviceand the second electronic device, after determining that the secondsubject is active and including the second subject in the secondrespective portion of the video content: determining that the firstsubject is inactive; and in response to determining that the firstsubject is inactive, updating the display to display a third respectiveportion of the video content, wherein the third respective portion ofthe video content does not include the first subject.
 18. The firstelectronic device of claim 13, wherein the determination that the secondsubject is active is based on movement of the second subject in thevideo content.